Bulk Synchronous Parallel Scheduling of Uniform Dags

نویسنده

  • Radu Calinescu
چکیده

This paper addresses the dag scheduling problem, proposing the bulk synchronous parallel (BSP) model as a framework for the derivation of general purpose parallel computer schedules of uniform dags, i.e., of dags that stand for tightly-nested loops with computable distance vectors. A general technique for the BSP scheduling of normalised uniform dags is introduced and analysed in terms of the BSP cost model, and methods for the normalisation of generic uniform dags are brieey overviewed in the paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Loop Parallelization in the Bsp Model

This paper introduces a new scheme for the scheduling of generic, untightly nested loops on distributed-memory systems. Being targeted at the bulk-synchronous parallel (BSP) model of computation , the novel parallelization scheme yields parallel code which is scalable, portable, and whose performance can be analytically evaluated.

متن کامل

Managing irregular remote accesses to distributed shared arrays in a bulk-synchronous parallel programming environment

NestStep is a parallel programming language for the BSP (bulk-synchronous parallel) programming model. In this paper we describe the concept of distributed shared arrays in NestStep and its implementation on top of MPI. In particular, we describe a novel method for runtime scheduling of irregular, direct remote accesses to sections of distributed shared arrays. Our method, which is fully parall...

متن کامل

Non-approximability of the Bulk Synchronous Task Scheduling Problem

The mainstream architecture of a parallel machine with more than tens of processors is a distributed-memory machine. The bulk synchronous task scheduling problem (BSSP, for short) is an task scheduling problem for distributed-memory machines. This paper shows that there does not exist a ρ-approximation algorithm to solve the optimization counterpart of BSSP for any ρ < 6 5 unless P = NP .

متن کامل

Scientific Computing on Bulk Synchronous Parallel Architectures

Bulk synchronous parallel architectures offer the prospect of achieving both scalable parallel performance and architecture independent parallel software. They provide a robust model on which to base the future development of general purpose parallel computing systems. In this paper we theoretically and experimentally analyse the efficiency with which a wide range of important scientific comput...

متن کامل

Scheduling User-Level Threads on Distributed Shared-Memory Multiprocessors

In this paper we present Dynamic Bisectioning or DBS, a simple but powerful comprehensive scheduling policy for user-level threads, which unifies the exploitation of (multidimensional) loop and nested functional (or task) parallelism. Unlike other schemes that have been proposed and used thus far, DBS is not constrained to scheduling DAGs or singly nested parallel loops. Rather, our policy enco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996